Corelab Seminar
2016-2017
Yannis Chatzimichos
Automatic detection of database joinability risks
Abstract.
Google has a lot of data. This data lies in a large number of databases.
Some of these databases should not be joined, notably for privacy or regulatory reasons. This talk will present an in-house framework based on sketching algorithms, that monitors for joinability risks in an automated and privacy-sensitive way at Google scale.